The Evaluation of Similarity Metrics in Collaborative Filtering Recommenders
نویسندگان
چکیده
Collaborative filtering recommenders predict user opinions regardless of item content. Memory-based implementations generate recommendations on demand, without any pre-computation. Even though such algorithms are slow, they are known for their superior personalization. We study collaborative filtering recommendations on the Netflix dataset. It is essentially a set of (user id, item id, rating) tuples that fit perfectly into a relational database model, with standard data manipulation operations. The simple structure of our system allows consistent and repeatable measurements. Additionally, it natively supports multi-threaded processing, which helps address the inherent performance drawbacks of our approach. We evaluate multiple similarity measures in a traditional collaborative filtering process. We also consider combinations of complementary measures, especially in edge cases when one of them falls short, e.g., a user with uniform ratings. We examine prediction accuracy, classification accuracy, confusion statistics, and actual/predicted distribution compatibility to find the best way to quantify vector similarity.
منابع مشابه
Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملA New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کاملA NOVEL FUZZY-BASED SIMILARITY MEASURE FOR COLLABORATIVE FILTERING TO ALLEVIATE THE SPARSITY PROBLEM
Memory-based collaborative filtering is the most popular approach to build recommender systems. Despite its success in many applications, it still suffers from several major limitations, including data sparsity. Sparse data affect the quality of the user similarity measurement and consequently the quality of the recommender system. In this paper, we propose a novel user similarity measure based...
متن کاملReco – Experimental Personalized Recommendation Framework
The intensive research in the personalized recommendation area results into the need for the automatizing routine processes within the recommenders’ design or evaluation. In this paper we propose a novel framework for evaluation and experimentation with recommenders. Proposed approach supports basic recommenders’ types – content based and collaborative approaches and allows researchers to add n...
متن کاملA comparative study of heterogeneous item recommendations in social systems
While recommendation approaches exploiting different input sources have started to proliferate in the literature, an explicit study of the effect of the combination of heterogeneous inputs is still missing. On the other hand, in this context there are sides to recommendation quality requiring further characterisation and methodological research –a gap that is acknowledged in the field. We prese...
متن کامل